Generic class-based statistical language models for robust speech understanding in directed dialog applications
نویسنده
چکیده
We investigate the usage of class-based statistical language models (SLMs) for robust speech understanding. Generic classbased SLMs are built using data from several applications and then tested on data from a distinct target application to benchmark their portability. The results show that these generic classbased SLMs perform as well as those trained on data from the target testing application. This leads us to conclude that, for directed dialog applications, words that do not fall within a rule (class) are generic across applications. Also, the generic classbased SLMs can be used to automatically transcribe utterances from the target application with high accuracy. These transcriptions are then used to train a word-based SLM; the resulting word-based SLM outperforms the class-based ones.
منابع مشابه
Enhancing commercial grammar-based applications using robust approaches to speech understanding
This paper presents a series of measurements of the accuracy of speech understanding when grammar-based or robust approaches are used. The robust approaches considered here are based on statistical language models (SLMs) with the interpretation being carried out by phrasespotting or robust parsing methods. We propose a simple process to leverage existing grammars and logged utterances to upgrad...
متن کاملCost-level integration of statistical and rule-based dialog managers
Statistical dialog managers can potentially make more robust decisions than their rule-based counterparts, because they can account for uncertainties due to errors in speech recognition and natural language understanding. In practice, however, statistical dialog managers can be difficult to use, as they may require a large number of parameters to be inferred from limited data. Consequently, han...
متن کاملRobust methods in automatic speech recognition and understanding
This paper overviews robust architecture and modeling techniques for automatic speech recognition and understanding. The topics include robust acoustic and language modeling for spontaneous speech recognition, unsupervised adaptation of acoustic and language models, robust architecture for spoken dialogue systems, multi-modal speech recognition, and speech understanding. This paper also discuss...
متن کاملCan Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?
Identifying whether an utterance is a statement, question, greeting, and so forth is integral to effective automatic understanding of natural dialog. Little is known, however, about how such dialog acts (DAs) can be automatically classified in truly natural conversation. This study asks whether current approaches, which use mainly word information, could be improved by adding prosodic informati...
متن کاملThe IBM conversational telephony system for financial applications
We describe our development work on a telephonebased conversational system in the domain of mutual fund transactions. This system uses several components including robust large vocabulary continuous speech recognition, natural language understanding, dialog management, and text-to-speech synthesis technologies.
متن کامل